An improved speech transmission index for intelligibility prediction

نویسندگان

  • Belinda Schwerin
  • Kuldip K. Paliwal
چکیده

The speech transmission index (STI) is a well known measure of intelligibility, most suited to the evaluation of speech intelligibility in rooms, with stimuli subjected to additive noise and reverberance. However, STI and its many variations do not effectively represent the intelligibility of stimuli containing non-linear distortions such as those resulting from processing by enhancement algorithms. In this paper, we revisit the STI approach and propose a variation which processes the modulation envelope in short-time segments, requiring only an assumption of quasi-stationarity (rather than the stationarity assumption of STI) of the modulation signal. Results presented in this work show that the proposed approach improves the measures correlation to subjective intelligibility scores compared to traditional STI for a range of noise types and subjected to different enhancement approaches. The approach is also shown to have higher correlation than other coherence, correlation and distance measures tested, but is unsuited to the evaluation of stimuli heavily distorted with (for example) masking based processing, where an alternative approach such as STOI is recommended. 2014 Elsevier B.V. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Acoustic Study of an Auditorium by the Determination of Reverberation Time and Speech Transmission Index

The quality of the communication between teachers and students and ultimately, of classroom education itself, is closely linked to the acoustic quality of the auditorium. This acoustic quality can be characterized based on the reverberation time (RT), speech transmission index (STI) and the sound insulation. In this context, an acoustic study was conducted in an auditorium located in the Higher...

متن کامل

Predicting speech intelligibility in conditions with nonlinearly processed noisy speech

The speech-based envelope power spectrum model (sEPSM; [1]) was proposed in order to overcome the limitations of the classical speech transmission index (STI) and speech intelligibility index (SII). The sEPSM applies the signal-tonoise ratio in the envelope domain (SNRenv), which was demonstrated to successfully predict speech intelligibility in conditions with nonlinearly processed noisy speec...

متن کامل

Improving the prediction power of the speech transmission index to account for non-linear distortions introduced by noise-reduction algorithms

Although the speech transmission index (STI) has been shown to predict successfully the effects of linear distortions introduced by filtering and additive noise, it does not account for non-linear distortions present in noise-suppressed speech. In this study, the normalized covariance metric (NCM), a STIbased intelligibility measure, was modified to reduce the effects of non-linear distortions ...

متن کامل

Objective prediction of speech intelligibility at high ambient noise levels using the speech transmission index

In many cases the intelligibility of speech in noise may be assumed independent of the absolute sound level; the speech-to-noise ratio (SNR) primarily determines intelligibility. However, at high sound levels, speech intelligibility is found to decrease. Subjective Speech Reception Threshold (SRT) measurements were performed at various speech and noise levels, and with various noise spectra. De...

متن کامل

Evaluating a distortion-weighted glimpsing metric for predicting binaural speech intelligibility in rooms

A distortion-weighted glimpse proportion metric (BiDWGP) for predicting binaural speech intelligibility were evaluated in simulated anechoic and reverberant conditions, with and without a noise masker. The predictive performance of BiDWGP was compared to four reference binaural intelligibility metrics, which were extended from the Speech Intelligibility Index (SII) and the Speech Transmission I...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Speech Communication

دوره 65  شماره 

صفحات  -

تاریخ انتشار 2014